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Applications of a Theorem on Partitioned Matrices 

Emilie V. Haynsworth 

(May 25, 1959) 

A reduction formula for partitioned matrices is applied to block-stochastic matrices 
and other types of partitioned matrices in order to reduce the computation in finding the 
eigenvalues. Such matrices occur frequently in physics and chemistry, and the reduction 
formulas given here have been applied successfully in practical problems. In addition, 
some results of A. Brauer on stochastic matrices and of J. Williamson in partitioned matrices 
are generalized. 



1. Introduction 

In a previous paper, 1 a general reduction formula 
was given for* certain partitioned matrices, 
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A u ^ 

A. 



(1) 



<*A n A t2 . . . A tt j 

of order N, where the submatrices, A ih have di- 
mensions UiXrij, i,j=l,2,. . . t, and 



i = l 

One of the special cases of this general theorem 
is of particular interest in practical applications to 
the problem of finding the eigenvalues of a matrix 
and, in fact, has been applied successfully to such 
problems. This special case is given below as the- 
orem 1, since its proof is simpler than that for the 
general theorem and exhibits the transformation 
matrices needed for the reduction formula. 

This result is used in section 3 to give generaliza- 
tions of some theorems by A. Brauer 2 on stochastic 
matrices. Other applications are given in later 
sections. 

As in the previous paper, for a given partitioning 
of a matrix A we shall call the subma trices, A tj , 
the blocks of A and we shall write A=(Aij). The 
" elements of the blocks will be denoted by a ( ll\i.e., 

A u =(a<<{>). 

Unless otherwise stated, the matrices will be 

1 E. Haynsworth, Reduction formulas for partitioned matrices (to be 
published). 

2 A. Brauer, Limits for the characteristic roots of a matrix, IV, Duke Math. 
I J. 19, 75(1952). 
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arbitrary complex matrices. Also, since we will 
be dealing throughout with matrices A=(A i3 ) of 
form (1), we will assume, unless otherwise indicated, 
that the statements and formulas given are true 
fori,j=l,2, . . .,t. 

2. Special Reduction Formula 

For the sake of completeness we include a lemma 
from the previous paper x which is needed in the 
proofs of theorems 1 and 3. In this lemma we 
consider three cases where there are zeros in con- 
venient places in the blocks of a partitioned matrix, 
A. In each of these cases A is reducible, and the 
proof consists merely in defining the permutation 
matrix which puts A into the reduced form, 



\D T) 



(2) 



where C and T are square matrices and is a matrix 
composed entirely of zeros. 

Lemma: Given a partitioned matrix A of order N 
with n^Xnj blocks A tj : 

1. If rii=n (i=l,2, . . ., t) and the blocks A fJ 
are lower triangular with elements \ { £ , h= 1 ,2, . . ., n, 
on the diagonal, A is similar to a matrix A, with 
blocks A hh =(X < i) ) ), h=l,2, . . ., n, on the diagonal, 
and zero blocks above the diagonal (this case is given 
by Williamson 3 ) . 

2-// 



(C t , 0\ 

\Uij 1 ijj 



(3) 



where all matrices T^ are square, of order r, and the 
matrices dj are (jti— r)X (jij— r), then A has the form 
(2), where C={C i3 ) and T=(T iJ ). 

3. If Aij has the form (3) where the matrices T tj 
are lower triangular, we will say A tj is partially trian- 
gular. Then if A has blocks A {j which are all par- 
tially trianqular with submatrices, T tj of order r, having 
elements tffi, h=l,2, . . . , r, on the diagonal, tr 
roots of A are roots of the r matrices, (tffi). 

3 J. Williamson, The latent roots of a matrix of special typo, Bull. Am. Math. 
Soc. 37, 585 (1931). 



r>1!>s:;ri :■;. 
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The proofs given below would follow in a corre- 
sponding manner if all blocks were transposed. 

Proof: 

1. The rows and columns of A should be arranged 
in the order 



1,71+1,2^+1, 

2, n+2, 2ti+2, 
n, 2n, 3n, 



., (t-l)n+l; 

., (t-l)n+2; 



(4) 



tn. 



Then the new matrix A will have the matrices 
A h h=Q^if^ on the diagonal and A hk =0 for i<j, 
so its roots are the roots of the n matrices, A hk . 

2. LetS^i=-ZV*. Then Ni=n u N t =N. If we 

arrange the rows and columns of A in the following- 
order: 

1,2, .,., 7V,-r, 

JV,+1, M+2, . . ., JV 2 -r, 

iVi-t+l, iV,_i+2, • • -, JV,-r, (5) 



N t -r+l,N t -r+2,...,N t , 

we have a new matrix A in which the matrices 
dj are together in the upper left corner, and the 
matrices T (j are together in the lower right corner. 
So A will have the form (2) where C=(C«) and 

r=(r„). 

3. Case 3 follows immediately now by first apply- 
ing the permutation in 2 to the rows and columns of 
A and then applying the permutation in 1 to the 
rows and columns containing T. 

Theorem 1: Suppose the blocks A tj oj the par- 
titioned matrix given in (1) satisfy the equation 



^ijX j =X i B ij , 



(6) 



where A[\ S) is square, of order r. Then A is similar to 
the matrix, 



R= 



B *\ 

o c) 



(8) 



where B is a partitioned matrix of order tr with blocks 
B ih as defined in (6), and C has blocks, 



Of 



=w>- 



-xpixn-'Air) 



(9) 



with dimensions (n t — r)X (nj— r). (If either n t or 
%=r, the corresponding block C t j does not appear. 
By hypothesis not all n t are equal to r, or else we 
would be left with the matrix B= (J5 i; -) which would 
be similar to A.) 

Thus the roots of A are the roots of the smaller 
matrices B and C. 

Proof: Let P t be a matrix of order n t \ 



p t = 



Yd) 







(10) 



then 



Pr'= 



OT) -1 

where I k represents an identity matrix of order k. 
Since by (7) and (10) 



01) 



AijPj— 



[Aipxr+Aipxi 



Aiij) 
-^-12 



then by (6), 

/[X^-Bv] 

and, using (11) and (9), 



Aiij 
^M2 



Aij — Pi AijPj- 







(Xn-'Ar 



(12) 



So, if we let P be the direct sum of the matrices 



/'., 



where B t j is a square matrix of order r, 0<jr^n i9 

with strict inequality for at least one value of i, and 

Xi is an n t Xr matrix with a nonsingular matrix of an( * ^ et 

order r, Xi f) , in the first r rows. Let the last 

n t —r rows of X { be X^, and let 



P=2-P t =P l +P i + ■ ■ ■ +P„ 



A=P- 1 AP, 



(13) 



JA«$ AS$\ 

\AW A^j 



then A has the blocks A i} given in (12), and the 
(7) simultaneous permutation of rows and columns 
of A given by case 2 of the lemma above, will put it 
into form (8). 
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following matrix arose from a problem in molecular 
physics : 4 



3. Generalizations of Brauer Theorems on statement 2 follows from (9). For example, the 
Stochastic Matrices 

A. Brauer 2 proved a number of interesting 
theorems about generalized stochastic matrices, i.e., 
matrices A= (a^) of order n such that 

£<>«=«, (i = l,2, . ..n). (14) 

Such matrices have as one root, s, and the vector cor- 
responding to this root is 



A= 



U n =(l,l, ..., 1) 



«14«) 
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66 


ab ( 
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cd 

dc 
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cd J 
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cd 1 
dc j 



The reduction formula of theorem 1 is applied in this 

section to matrices which can be partitioned into The lines are drawn in to show how A is subdivided 

stochastic blocks, to give generalizations of Brauer's into stochastic blocks. Then by theorem 3, the roots 



results. 

If a matrix A of order N can be partitioned into 
rectangular (ntXrij) blocks, in each of which the 
row-sums are all equal, i.e., 



with 



(A=l,2, 



k=] 2 



JL-i (l Vk' — &i 



i.l*J>= 



k = l 



(A=l,2, . . . ,n t ), 



, nj) 
(15) 



of A are those of the 5X5 matrix, 



C= 



A is a block-stochastic matrix. 



Theorem 2: If A is block-stochastic, (1) t roots of A 
are roots of the matrix S= (s^), and (2) the other N—t 
are roots of the partitioned matrix of order N—t, 

CtMaW-alP), (A=2,3,. . .,%, i=2,3,. . .,%). 

(16) 

/f ei^r n t or nj is 1, the block C^ is omitted, so i 
and j do not necessarily take on all values 1 , 2, . . . , t. 
Proof: Since, by (14a) and (15) 
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ose of the 4X4 matrix, 








r a+b 
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c+d f 


e+2f+g^ 





AijU rij — SifJ nj 



(17) 



It is easy to see that C can again be divided into 
stochastic blocks where fti=2, n 2 =l, n 3 =2. So 
the five roots of C are those of the 2X2 matrix, 



statement 1 follows immediately, for by (17), if we 

set r=l in (6) the matrices B tj of theorem 1 are Ci= 

simply the elements s tj given by (15). Then, if we 

subdivide A tj as in (7), A {i $ consists of all the 

elements of the block A {i except for those in the and the 3X3 matrix 

first row and column, i.e., 



a—b 




A&» = (atiP) (A=2, . . ., n,; k=2, . . ., n,) f 



and 



a= 



r a—b c—d 

2c-2d e-g 



A\P=(a$p) 



(k=2,.. .,%). 



s, c—d f—g e—2f+g^ 



Since by (17), the matrices X t in (6) are the 
vectors U n , we have 

^1 — ±y ^-2 — U nf-l. 

Then, since U n -iA[l j) is a matrix having in each 
row the elements 



<>, 






Thus e — 2f-\-g is a root, and there are two degen- 
erate roots since the matrix C\ corresponds to the 
quadratic matrix which remains when the root 
e — 2f J rg is factored out of <7 2 . 

Corollary (Theorem 2): If A is a block-stochas- 
tic matrix with n t Xn.j blocks A tj , and P=^tj-Pi, 
0=^2- Qt where P t and Q t are 'permutation matrices of 

4 The author is grateful to J. Bradley for supplying this example. 
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order n i} any matrix PAQ is also block-stochastic and 
has t roots in common with A. 

Proof: Since the row-sums in a given block would 
be unchanged by permutations within the block, this 
follows immediately from statement 1 of theorem 2. 

Brauer (see footnote 2) proved the following 
theorems for certain special stochastic matrices: 

1. Assume that 

aij=Cj 0=2, 3, . . . n; i<j). 

Then A has the same roots as the diagonal matrix 

B=dmg (a 22 —c 2 , a 3Z —c z , . . ., a nn — c n , s). 

2. Assume 

Uij=c, j 9^ i, and a a = b 

U=3u3h • • -,Jr). 



Then (b—c) is a root of multiplicity r. 

We generalize these results in the following 
theorem for block-stochastic matrices: 

Theorem 3: Let A be a block-stochastic matrix with 
submatrices A ij =(a i Ji j k ) having constants c k above the 
diagonal in the last r columns of each block, 0<><Cmin n t , 
i.e., 

ag J) =ci ij) (k>n nj -r; h<k+n t -nj). (18) 

Then tr of the roots of A are roots of the matrices 

Q»=<P£P-<£ S ) (k=nj-r+l, . . . , nj). (19) 

(The second of Brauer's theorems comes as a special 
case of the above if we permute the rows and columns 
so that the columns j u j 2r • • ,jr are the last r columns 
of the matrix. Also it isn't necessary that all the 
nondiagonal elements of a column be equal as in 
Brauer's hypothesis.) 

Proof: By theorem 2, N-t roots of A are roots 
of C= (Cij). Here we have i, j= 1,2, . . ., t, since by 
hypothesis, 0<y<C mm n u an d thus min n^>\. 

i i 

But by (16) and (18), C tj is a partially triangular 
matrix with elements a { tk— c ( V* on the diagonal of 
the triangular portion, i.e., for k=nj—r-\-l, . . ., nj. 
Then, using case 3 of the lemma above, we can 
permute the rows and columns of A so that A is 
similar to the matrix 



\0 Qj 



when Q= (Q hk ) is a matrix of order tr having square 
blocks of order t, defined by (15), on the diagonal, 
and Q hk =Q for k<^h. 



4. Other Applications 

In this section we will mention several other types 
of partitioned matrices, A=(Aij), such that 



PAP- 



B 

o a 



(20) 



where P is the direct sum of transformation matrices 
Pi, and B=(B ij ), C=(Cij), are partitioned matrices 
which will be defined in each case. We also give 
a case where the transformation PAP~ l produces 
a real matrix from a complex one. 

Since the proof in each case is the same and 
consists merely in performing the indicated matrix 
multiplications, it will not be given in detail. 

4. 1 . Block-Circulant Matrices 

This is the case where each block An is a circulant 
matrix of order n. The result (21) is given by 
Williamson. 3 

The roots of A are roots of the n matrices of 
order t, 



(XfiJ^gaWej) fo=l,2, ..., 



n) (21) 



where e h is one of the nth roots of unity. 
4.2. Block-Singular Matrices 

If A is singular, of rank n-1, there exists a vector 
, x n ), with components not all zero, 

AX=X-0. 



X [Xi, x 2 ,. 
such that 



We may assume, without loss of generality, that 
a?i=l. Then, as in theorem 1, if X=l-\-X 2 , we 
can let 



\-x 2 u_ x ) 



(22) 



and, applying the transformation (16), we can show 
that the nonzero roots of A are roots of 



, C=(a ij —x i a ij ) (i,j=2, 



,n). 



If A has rank less than n— 1, the same type of 
transformation can be applied to the matrix C and 
this can be continued until a nonsingular matrix is 
obtained. Or, we may be able to partition A into 
blocks so that 

AijXj=Xi* (/. 

Then if we let P=^,Pi, where each P t has the 
form (22), and apply transformation (20), we show 
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that the roots of A are the roots of 

B=(B tJ ) = (0) of order t 
and 

C=(C ij ) 
where 

(k=2, . . . , n t ; k=2, . . . , n,). 

4.3. Block Tridiagonal Matrices 
Let 



T(a) = 
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1 
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If, in addition, the matrix (a {j ) is stochastic, with 
row sum s, by theorem 1 , A has as n of its roots the 
number s+ (l/n-\-l)2t cos kw (k= 1 , . . ., n) . The other 
n(t—l) roots are degenerate with multiplicity n, 
since the matrices C tj in (5) become, by (22) for any 
value of n, 

(flv-(h,) (i, i=2, ...*). (27) 

4.4. Complex Matrices Similar to Real Matrices 

A=(A {J ) where each block is a 2X2 matrix defined 
as follows: 



(23) 



/a t +V-l6z \ 

Au=[ 

\ 0< -V-l6f/ 



(i=l, . . .,fc). 



(We use V — 1 instead of / to avoid confusion with 
the subscript.) 



Aij- 



a i3 b u 



A r - 



It is well known 5 that such matrices have charac- 
teristic roots which may be written in the form 

\ h =a+2 cos (hcjn+1) (k=l, 2, . . ., ri) (24) 

and, if we let B k =kirln-\-\ il is easy to show that the 
characteristic vector corresponding to X A: has com- A t j= 
poncnts 

(sin 6 k , sin 26 k , . . ., sin n6 k ) (25) 

since the trigonometric identities 

a sin ftk+sin 20*= (a +2 cos B k ) sin d k , 
sin (m—l)6 k -\-a sin m^+sin (7/1 + 1) k 

= (a+2 cos 6 k ) sin md k (m=2, . . ., n—1), 
sin (n—1) 6 k +a sin n$ k =(a-{-2 cos 0*) sin nO k 
hold for each fc as defined above. 

So if A=(Aij) where the blocks A tJ are all tri- 
diagonal, 

A.ij= 1 {dij) } 



dij 



bij 
ha 



fa u 



\bij b (j 



(V = l,2, . . .,k; i±j) 
(i=l,2,...,k; j=k+l,...,t) 
(i=k+l, . . ., t\ j=l, . . ., k) 



and Aij an arbitrary real matrix (i,j=k+l, . . .,£). 
Then, 



r i+V-i 1— V— i " 

2 2 

1-V 3 ^ l+V 11 ! 



(1=1,2,...,*), 



2 2 

Pi=I 2 (i=k+l,. ...n). 
-b y 



(a -b\ 
\b a) 



(i=l, . . ., fc), 



these blocks can be simultaneously reduced to p A P~ 1 —A 
triangular form using the matrix of characteristic 

iiA i j=A i j 



vectors (25), since these are independent of the 
elements a tj . Thus, according to a theorem by 
Williamson, 1 the roots of A are those of the n 
matrices, 

(dij+2 cos (ir/n+1)), . . ., 

(a ti +2 cos (nwln+1)). (26) 



Since 



(i,i=l, . . .,&;), 

(i=l, . . ., k, j=k=l, . . ., 0, 



5 D. E. Rutherford, Some continuant determinants arising in physics and 
chemistry, 11, Proc. Hoy. Soc. Edinburgh 68, III (1952). 



A=PAP~ l 



(28) 
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has the same elements as A except in the 2X2 blocks 
on the diagonal for i=l, 2, . . ., k, and the elements 
in these blocks are real. (Matrices of this type also 
arose from an applied problem at the National 
Bureau of Standards.) 

5. Applications to Determinants and 
Systems of Equations 

The transformations given here for a partitioned 
matrix A= (Ay) can obviously be applied to finding 
the determinant of such a matrix since if 



A=PAP~ 



D 

a 



(29) 



\A\ = \A\ = \B\.\C\. 



These transformations could also be profitably 
applied to the solution of a system of linear equations 
with coefficient matrix A; i.e., the system of equa- 
tions 

AX=K (30) 

is, by (29), the same as 



Then if X=X l + X 2 and K=K l +K 2 are partitioned 
to correspond with the dimensions of B and C, we 
can first solve the system 



CX 2 =K 2 



and then the system 



BX X =K X -DX 2 . 



(32) 



(33) 



So, solving these two systems of linear equations 
and the comparatively simple system (31) (which 
can be partitioned as P is partitioned), we have the 
solution of (29) . 

Such a transformation would be especially valuable 
in solving a large system on an electronic computer, 
where the length of time for solution goes up on 
the order of n 3 . Thus, if we cut the dimensions in 
half by transformation (29), systems (32) and (33) 
would each take one-eighth as long to solve as the 
original system, and if we assumed only that P 
could be partitioned into two parts the total solution 
time would be at most one-half that of the original 
system. In many cases the time could be cut to 
approximately one-fourth or less. 



or 


(P~ 1 AP)X=K 




AX=K 


where 






pk=k 


and 






PX=X. 



(31) Washington", D.C. 



(Paper 63B2-9) 
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